Improving Discretization by Post- Processing Procedure

نویسندگان

  • Taijun Han
  • Sangbum Lee
  • Sejong Oh
چکیده

Abstact. Bioinformatics and data mining require data analysis schemes. Many methods of analysis, such as those focusing on entropy, have been developed and assume that the input data has discrete values. Therefore, when using continuous data, discretization needs to be performed before analysis can begin. Many discretization algorithms have been proposed, and these discretize a given dataset attribute-byattribute. Although such methods assume that the attributes are independent from each other, in reality these attributes interact with and influence the results of the analysis as a group, not individually. In this paper we propose a post-processing method that can improve the quality of discretization. After the normal discretization process, we adjust the boundary point of the discretization for each attribute, and then after evaluating the group effect of the adjusted point, we update the original boundary point by adjusting it if it has a positive influence on the attribute. The results of the empirical experiments show that the adjusted dataset improves the classification accuracy. Proposed method can be used with any discretization algorithms, and improves their discretization power.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Combined Cognitive Rehabilitation Interventions (Computer and Manual) on Improving the Information Processing Speed and Psychological Status of Women with MS

The purpose of this study was to investigate the effect of combined cognitive rehabilitation interventions (computer and manual) on improving the speed of information processing and improving the mental status variables (MMSE) in women with MS. This research was conducted with pre-test and post-test design with two month follow-up. Twenty two women with MS were selected by available sampling me...

متن کامل

A perturbation-method-based post-processing for the planewave discretization of Kohn-Sham models

In this article, we propose a post-processing of the planewave solution of the Kohn–Sham LDA model with pseudopotentials. This post-processing is based upon the fact that the exact solution can be interpreted as a perturbation of the approximate solution, allowing us to compute corrections for both the eigenfunctions and the eigenvalues of the problem in order to increase the accuracy. Indeed, ...

متن کامل

Coupling discontinuous Galerkin methods and retarded potentials for transient wave propagation on unbounded domains

This work deals with the numerical simulation of wave propagation on unbounded domains with localized heterogeneities. To do so, we propose to combine a discretization based on a discontinuous Galerkin method in space and explicit finite differences in time on the regions containing heterogeneities with the retarded potential method to account the unbounded nature of the computational domain. T...

متن کامل

An Anisotropic Error Estimator for the Crank--Nicolson Method: Application to a Parabolic Problem

In this paper we derive two a posteriori upper bounds for the heat equation. A continuous, piecewise linear finite element discretization in space and the Crank-Nicolson method for the time discretization are used. The error due to the space discretization is derived using anisotropic interpolation estimates and a post-processing procedure. The error due to the time discretization is obtained u...

متن کامل

Selective prosodic post-processing for improving recognition of French telephone numbers

This study describes a selective prosodic postprocessing procedure for improving the recognition of telephone numbers in French. The aim of the post-processing procedure is to recover recognition errors made by an HMM based ASR system. Instead of a global post-processing, this paper proposes a selective one. Post-processing is carried out only on some recognised numbers and only if its associat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015